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Abstract 

We study a natural discrete Bochner-type inequality on graphs, and explore its 
merit as a notion of “curvature” in discrete spaces. An appealing feature of this 
discrete version of the so-called U-calculus (of Bakry-Emery) seems to be that it 
is fairly straightforward to compute this notion of curvature parameter for several 
specific graphs of interest - particularly, abelian groups, slices of the hypercube, and 
the symmetric group under various sets of generators. We further develop this notion 
by deriving Buser-type inequalities (a la Ledoux), relating functional and isoperimetric 
constants associated with a graph. Our derivations provide a tight bound on the 
Cheeger constant (i.e., the edge-isoperimetric constant ) in terms of the spectral gap, 
for graphs with nonnegative curvature, particularly, the class of abelian Cayley graphs 
- a result of independent interest. 


1 Introduction 

For several decades now it has been a fruitful endeavour to translate notions from Riemannian 
geometry to graph theory. It is now clear what are the graph analogs of the laplacian, 
Poincare inequality, Harnack inequality, and many related notions. The graph point of view 
led to generalizations which would have been less natural in Riemannian geometry, such as 
/3-parabolic Harnack inequalities (see, e.g., [5]), and to some counterexamples [4, 13, 22], 
Despite all this progress, the graph analog of the notion of curvature remained elusive. In 
their 1985 paper, Bakry and Emery [2] suggested a notion analogous to curvature that would 
work in the very general framework of a Markov semigroup (which, of course, incorporates 
both continuous diffusions and random walks on graphs). The condition was based on the 
Bochner formula and was denoted by CD(K,oo) (for curvature-dimension) where K is a 
curvature parameter. A semigroup satisfying CD(K, oo) is a generalization of Brownian 
motion on a manifold with Ricci curvature > K and hence the condition CD(K, oo) is 
often called simply “Ric > K v and we will stick to this convention in this paper. This 

* Research supported in part by the European Research Council. 

t Research supported in part by the Israel Science Foundation and the Jesselson Foundation. 

^Research supported in part by the NSF grant DMS-1407657. 


1 



notion as a possible definition of “Ricci curvature” in Markov chains was in fact considered 
and discussed in [33] in 1999, but seems to have largely been neglected ever since. For 
additional and more recent approaches to discrete Ricci curvature and related inequalities, 
see [6, 16, 19, 27, 30, 32, 34], The fact that one can conclude from positive (or negative) 
curvature, a local property, global facts about the manifold, has inspired similar “local-to- 
global” principles in group theory. See e.g. [18, 31]. 

Beyond lower bounds on curvature, the proofs in [2] (and in the recent book [3]) rely on 
two additional assumptions on the semigroup. The first was the existence of an appropriate 
algebra of smooth functions. The second was a chain-rule formula for the generator of the 
semigroup. A generator satisfying the latter assumption is called a diffusion operator, see [3, 
Definition 1.11.1, page 43]. In continuous setting it is actually the existence of the required 
algebra of smooth functions that is the most difficult condition to verify, but in graph settings 
this condition holds immediately. Nevertheless, the diffusion condition can never hold in the 
discrete setting. 

However, the diffusion condition is not always necessary. Denote the Cheeger constant 
(sometimes known as the isoperimetric constant) by h, the spectral gap by A and recall the 
inequality of Buser [8] that states that for a manifold with non-negative Ricci curvature 
A < 9 h 2 (exact definitions will be given in the next section). In 2006, the first two authors 
noted that the arguments of Ledoux [23], allow to derive a discrete Buser-type inequality 
just assuming non-negative Ricci curvature. 

Theorem 1.1. A graph satisfying Ric > 0 satisfies that A < 16 h 2 . 

The graph version of Cheeger’s inequality (e.g. [1, 10]), which does not require positive 
curvature, states that A > h 2 /(2d), where d is the maximum degree of the graph. Thus for 
graphs with non-negative Ricci and bounded degree we get that A ~ h 2 . As the results from 
2006 were never published, we include them in §4. A preprint of these results did circulate 
and a number of papers built on it [6, 26]. Particularly relevant for us is the paper [26] 
which shows that the eigenvalues of the laplacian on a graph with positive curvature satisfy 
A k < Ck 2 Ai. In a similar spirit, we use the techniques of [23] to show a Gaussian type 
isoperimetric inequality for graphs satisfying Ric > 0 (see Section 4.3 below). 

In light of Theorem 1.1, an intriguing and challenging open problem is to characterize 
the class of graphs with non-negative Ricci curvature. The main new results of this paper 
are examples of such graphs which satisfy Ric > 0. These include Cayley graphs of abelian 
groups, the complete graph, the group S n with all transpositions, and slices of the hypercube. 

In particular, we get Buser’s inequality for any Cayley graph of a finite abelian group. We 
remark that this is not true for a general group. For example, the Cayley graph of the group 
S n with the generators being {(12), (12 ... n) 1 * 11 } has h of order 1/n 2 and A > 1/n 3 , up to an 
absolute constant (we fill some details about these well-known facts in §2.3). This should be 
compared against the fact that any compact Lie group has positive Ricci curvature, see [9, 
Corollary 3.19, page 65]. 

Note that our results above translate to A(M) < 16 d h 2 {M ) for a simple random walk 
M on an abelian Cayley graph, regular of degree d, with h(M ) and A(M) being defined for 
the Markov chain version. A result of the above type is also recently derived independently 
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by Erbar and by Oveis-Gharan and Trevisan (private communications). An earlier, weaker 
result, A(M) = 0(d 2 h 2 (M )) follows from the work in [6], which uses a different notion of 
curvature (and a different argument of Ledoux), starting from a finite-dimensional curvature- 
dimension CD(K,n) inequality for graphs. 

Recently there have been several attempts to modify the CD(K,n) criterion in order 
to allow certain results involving the heat equation [6, 21, 29]. A recent result of Munch 
[29] is that the CDE'(K, n ) criterion of [21] implies the CD(K , n ) criterion of Bakry-Emery. 
These criteria are often useful; for example, it is known that Ricci-flat graphs satisfy both 
the CDE( 0, oo) criterion of [6] and the CDE'( 0, oo) criterion. 

In the remainder of this section, we introduce Bochner’s ^-type curvature for graphs 
along with various notations and definitions. In Section 2, we bound the curvature for 
several examples, including slices of the discrete cube, symmetric group with adjacent as 
well as all transpositions as the generating sets; and nonnegativity of curvature for Cayley 
graphs of abelian groups. In Section 3, we show that the spectral gap can be bounded from 
below by curvature. In Section 4, we derive the above-mentioned Buser-type inequalities. 


1.1 Preliminaries 

We first recall some basic definitions and fairly standard notions. Let G = (V, E) be an 
undirected and locally finite graph. Throughout, we will assume that G has no isolated 
vertices. The graph Laplacian A = A (G) = —(D(G) — A(G)), where D(G) is the diagonal 
matrix of the degrees of the vertices, and A(G) is the adjacency matrix of G. As an operator, 
its action on an / : V —> M can be described as: 


A f( x ) = - f( x ))- 


y~x 


where here and below the notation y ~ x means that y is a neighbour of x in the graph. 
The sum is of course only over the y. Note that A is a negative semi-definite matrix. 

The spectral gap A (G) is the least non-zero eigenvalue of —A. We define the Cheeger 
constant 

\9A\ 


h(G) = min 

0<|A|<|V|/2 \A\ 


where \dA\ denotes the number of edges from A to V 
Given functions /, g : V —> R, we also define: 


A. 


r (f,g)(x) = - f(y))(g(x) -g(y)) ■ 


yr^x 


When f = g, the above becomes the more commonly denoted (square of the / 2 -type) discrete 
gradient: for each x G V, 

r (f)(x) := r(/, f)(x) = i £(/(*) - S(y)f =: Iv/wr ■ 

y~x 
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It becomes useful to define the iterated gradient 


2r 2 (/, g) = Ar(/, g) - T(f, A g) - r(A/, </). 


By convention, 

r 2 (/) := r 2 (/, /) = jAr(/) - r(/, a/). 

Note that, given a measure 7r : V —)• [0, oo), one can consider the expectation (with respect 
to 7 r) of the above quantity, which gives us the more familiar Dirichlet form associated with 
a graph: 

£(/> 3 ) : = ^ _ f(y))(9( x ) “ sfo)) 71 ^) • 

a: 

It. is useful to note an identity: 

(f,9)i x ) = -^/(^)A^(a:) = ~^2g(x)Af(x). (1) 

x£V xGV xGV 

An additional useful local identity is: 


A {fg) = fAg + 2T(f, g ) + gAf, 


( 2 ) 


Definition 1.1. The (Bochner) curvature Ric(G) of a graph G is defined as the maximum 
value K so that, for any function / and vertex x, we have 


T 2 {f){x) > KT(f)(x). 


(3) 


Let x EV, and let f : V —> M be a function. Observe that (3) is unchanged on adding a 
constant to /, so we may assume that f(x) = 0. We expand T 2 (/)(x): 


2r 2 (/)(i) = Ar (/)(*) - 2r (/, a f){x) 

= V r(/)(«) - d(x)r(/)(x) - E m (A/w - a/(x)) 

v~x 


\ E (/w-/(”)f-?Etw+E/wE/w- E /(»)(/( 


n 


/(<>)) 




u^v^x 




d ^J2f(v) I ^ / 2 H ~4/(n)/(^) + 3/ 2 (^) 


fE/w) 2 -E^r^/ a w + 5 E (/W-2/W) 2 . 

\v~X / U~V~X 


(4) 


Now, we break the latter term into the cases that w = x, u ~ x and d(x, w) = 2. In the second 
case, we denote by A (x,v,u) the set of all unordered pairs (u,v) satisfying x ~ u ~ v ~ x. 
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The above is equal to 


2r 2 (/) = 


(/(“)- 2 f(v)) 2 + + Y1 

u~v~x \v~x / v~x 

d(x,u )=2 

+ (/(^) ~ 2 /( u )) 2 + (/( X > - 2 / o )) 2 

A(tc,t;,w) 

5 E (/m- 2 /wf+fe/w) +E 

U^Vf^X \v r ^X / V^X 


d(x ) + d(u) \ 2 


/>) 


4-d(g) -d(u) ^. 2 


d(x,u )=2 


E 


2 (/(«) - /(w)) 2 + ^ (/ 2 (w) + / 2 (w)) 


(5) 


Fixing /(u) for all vertices u ~ x, we may ask what choice of /(-u) (for d(a;, u) = 2) minimizes 
the above expression? We wish to minimize 

5 E (/W-2/W) 2 , 

v: 

x~v~u 

it is simple to see that the minimizer is 

/(“) = 2 ’xy E ’ ( 6 ) 


where r(u) is the number of common neighbors of u and x. 

We first prove a general upper bound on the above notion of curvature, which will be 
used in the next section, to show tightness of our bounds on curvature for several example 
graphs. 

Theorem 1.2. Let G = ( V ., E ) be a graph. If e € E, let t(e) denote the number of triangles 
containing e. Define T := max e i(e). Then Ric(G) <2 + 2-. 

Proof. Let x G V be any vertex with the minimum degree d, and consider the distance (to 
x) function f{v) = dist(u,x). It is simple to calculate that 

2r 2 (/ )( x) S i 2 + E ( 2 - + £ i<2d + f, 

v~x ' ' A (x,v,u) 

observing that 

|A(x,u,u)| = ^^t(a;,v) < ^ 

V~X 

and that T(f)(x) = \d. Any value of K > 2 + will not satisfy (3) for the function / at 
vertex x, thus Ric(G) < 2 + j. □ 
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2 Examples 

In this section we provide bounds on the curvature for several graphs of general interest. 


2.1 The hypercube H n 

Let H n represent the n-dirnensional hypercube, where vertices are adjacent if their Hamming 
distance is one. While the following result also follows from the tensorization result of [33], 
we provide here a direct proof. 

Theorem 2.1. Ric (H n ) = 2 if n > 1. 

Proof. For any vertex x G H n , and for any / with f(x) = 0, we get from (5) 

2r 2 (/)(z) = i Y (f( u ) * 2 f( v )f+ (y^ v )] + ( 2 -^)^/ 2 ( y )- 

i/ w \ rv v: \v~x / 

a(x,u )=2 x^w^u 

Let a be a vertex of distance 2 from x, and let v and w be the two distinct vertices so 
that Then for fixed values of f[y) where v ~ x, according to (6) 

r 2 (/)(x) is minimized by f[u) = f(v) + /(«;). With this value, 


(f(u) ~ 2f(v)) 2 = 2 (f(v) - f(w)) 2 . 


As for every pair v, w ~ x there is a unique vertex u with u ~ v,w and d(x, u ) = 2, 


2 r 2 (/)W> y (/(„)-/W) : 


fe/(* 


Vy^W 

V,W^X 


+ (2-™)^/ 2 (^), 
V^X 


where the first sum is over all unordered pairs (u, w) of distinct neighbors of x. We use this 
convention throughout the paper. Expanding the above gives 

Y (f 2 ( v ) + / 2 H) - 2 f(v)f(w) + Y f 2 ( v ) + 2 f( v )f( w ) + ( 2 - «) ^ f 2 {v ) 

Vy^W Vy^W V^X Vy^W V^X 

V,W~X V,W~X V,W~X 

= 2 yy w = 4r ( /)(x). 

V~X 

So Ric > 2, and by Theorem 1.2 we may conclude that Ric = 2. □ 


In the following, we compute the curvature of the complete graph. With the tensorization 
result of [26], this provide another proof of the fact that the hypercube has curvature 2. 
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2.2 The complete graph K n 
Theorem 2.2. Ric(A" n ) = 1 + | if n > 2. 

Proof. For the complete graph on n vertices, we have, for every and every / : R —> M 

such that /(x) = 0, from (5), 

2r 2 (/)(x) = 

( T, / w) + ( 3 - n ) y / 2 (”) + y ( 2 (/(«) - /(«» 2 + 5 (/w 2 +/(«) 2 )) • 

t;~a: v^x u,v^x 

u^v 

Expanding the above gives 

Y + Y 2 f( u )f( v ) + ( 3 - n ) Y f 2 ( v ) + 1 Y (/ 2 ( y ) + / 2 ( m )) - Y 4 /( m )/( u ) 

u, V^X v~x u,v~x u,v^x 

u^v u^v u^v 

= (4 -n)J2 f 2 (v) + \(n - 2) £ /» - 2 ]T /(„)/(„) 

Vr^X U,Vr^>X 

u^v 

y~ 1 ) Yf 2 ^- 2 Y = y f 2 ^ - (Y /(*>)) • 

' R~CC u,v^x v^x v~x 

u^v 

By the Cauchy-Schwarz inequality, (J2v~ x f(v)f < |{ v : u ~ x }| / 2 (u) = (n- 

SO 

f£/ 2 M-(z/ mV 5 ( 1 + f)£/>). 

i?~a; \v~x / R~:r 

Thus Ric > 1 + |, once again by Theorem 1.2, we conclude that Ric = 1 + |. □ 


2.3 Finite abelian Cayley graphs 

A finite abelian group is of course a product of cyclic groups and hence one might think that 
the curvature of the graph can be deduced from the tensorization result of [26]. However, 
a Cayley graph is determined by an underlying group and a generating set for that group. 
Here we show that a finitely generated abelian group with any set of generators has positive 
Ricci curvature - not only with the generating set inherited from a decomposition into cyclic 
groups. This result was implicit in the literature, since abelian Cayley graphs are “Ricci 
flat” [12], and this property, in turn, gives Ric > 0 [25]. We give here a direct proof. 

Let us remark that the problem of graphs locally identical to an abelian group has also 
been attacked successfully using combinatorial tools. See [7] and references within. 

Theorem 2.3. Let X be a finitely generated abelian group, and S a finite set of generators 
for X. Let G be the Cayley graph corresponding to X and S. Then Ric(G') > 0. 
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Recall that the Cayley graph of a group G with respect to a given set S which generates 
G is the graph whose vertices are the elements of G and whose edges are {(g, gs)} ge a,ses- 
Since we are interested in undirected graphs, S should be symmetric i.e. s G S =>■ s -1 G S. 

Proof. Without loss of generality, we may set x to be the identity element of X. Denote the 
degree of every vertex by d. As usual, let / : G —» M with f{x) = 0. 

For this calculation, we prefer not to distinguish between u according to their distance 
from x so we start the calculation from (4) and using the constant degree get 

2r 2 (/)(x) = d^/ 2 (u) + (^/(u)) + YY ■ ( 7 ) 

V^X Vr^X V^X U^V ' ' 

Because x is the identity, we observe that if u ~ v ~ x, there is a unique w ~ x so that 
u = vw. We can express the last term of (7) as 

ZZ(™~ = EE (Ap 1 - 2 /(«»)/(»)) 

V^X u^v ' ' v^x w<~^x ' ' 

= 5Z _ 2 f(v 2 )f(v)\ + {f( vw ) - 2/ (vw) (f(v) + f(w)j) 

V~x ' ' v,w~x 

v^w 

>~ 2 Y- Y (/(*>) + /M) 2 = m- x ) Y-f 2 ^ - 2 Y f( v )f( w )■ 

V j'W rsJ X V r ^'X V j'W r ^ J X 

VJ^W Vy^W 


In the last passage we used the elementary inequalities a 2 /2 — 2ab > — 2b 2 and a 2 — 2ab > —b 2 
Plugging this bound into (7), we find that 


2r 2 (/)0) > 



Y/ 2 ( u )~ 2 Y f( v )f( w ) = °- 

V rsJ X V )UJ r ' J X 

Vy^W 


This completes the proof. □ 

Now, the assumption that the group is abelian is necessary. An infinite example demon¬ 
strating this is the d-ary tree, which is the Cayley graph of the group {si,...,Sd : s 2 = 
id for i = 1, ...,d) with the generating set si,..., . This graph has Ric = 2 — d, which is 

achieved whenever J2 y ~ x f(y) = 0 and f(z) = 2 f(y) whenever z ~ y ~ x. This is optimal; 
it is not difficult to see that no d-regular graph has Ric(G) <2 — d. 

A little more surprising, perhaps, is that the Heisenberg group also has negative curvature. 

We mean here the group of upper triangular matrices with 1 on the diagonal and inteqer 
, . 1 f /I ±1 0\ /I 0 0\ 1 T . . , 

entries , equipped with the set of generators < I i o I , ( i ±i 1 >. ft is straightforward to 

check that these generators do not satisfy any relation of length 4, so the environment within 
distance 2 (which is the only relevant distance for calculation of the curvature) is tree-like, 
and the curvature would be —2. 






Switching to finite Cayley graphs, it is well-known that there exist finite Cayley graphs 
which are locally tree-like, and hence would have negative curvature. What is perhaps more 
interesting is that even Buser’s inequality (the conclusion of Theorem 4.2) may fail. 

Theorem 2.4. For the group S n and the (left) Cayley graph generated by {(12), (12 ... n) ±l }, 
the Cheeger constant is < c\n~ 2 , while the spectral gap is > c^n^ 3 , with ci,C 2 > 0, indepen¬ 
dent of n. 

Proof sketch. To show an upper bound on the Cheeger constant, we consider the following 
set: 

A = {0 e S n : dist(0(l),0(2)) < \n} 

(there is no connection between the 1 and 2 in the definition of A and the fact that we took 
(12) as a generator). Here dist is the cyclic distance between two numbers in {1,... ,n} i.e. 
min(|a: — y\,n— \x — y\). Clearly |H| = (| + o(l))n!. To calculate the size of the boundary we 
first note that the generators (12 ... n) ±l keep A invariant, so the boundary of A is composed 
of edges between 0 E A and (12)0 0 A. This makes two requirements on 0: first it must 
satisfy that dist( 0 (l), 0 ( 2 )) = \_\n\ , and second it must satisfy that one of 0 ( 1 ), 0 ( 2 ) is in the 
set { 1 , 2 } otherwise the application of ( 12 ) does nothing to 0 ( 1 ) and 0 ( 2 ) and ( 12)0 would 
still be in A. Thus dA m n\/n 2 and h > c/n 2 (this argument gives c = 2 + o(l)). 

The estimate of the spectral gap (from below) for the random walk on this Cayley graph 
was done by Diaconis and Saloff-Coste (see Section 5.3 in [14]), as an example of the com¬ 
parison argument - comparing with the random transposition chain, which has a spectral 
gap of order 1 /n, gives a lower bound of (l/ 10 )n -3 for this chain; since the graph has a 
bounded degree, the spectral gap of the graph laplacian is only a constant factor off that of 
the random walk on the graph. 

For the convenience of the reader, and for completeness, we now sketch a proof of a 
lower bound of l/(u 3 logn), which serves to justify the point of the theorem. We construct a 
coupling between two lazy random walkers on our group S n that succeeds by time n 3 log n. 
It is well-known (see e.g. [24]) that this bounds the mixing time, and hence the relaxation 
time, which is the inverse of the spectral gap. The coupling is as follows: assume 0 n and 
0 n are our two walkers. We apply exactly the same random walks steps to 0 n except in one 
case: when for some i 0 n (0 = 1 and 0 n (0 = 2. In this case when we apply a (12) step for 
0 n we apply a lazy step to to 0 n , and vice versa (the (12 ... n) ±l are still applied together). 
It is easy to check that for each i, 0 n (0 — 0 n (0 is doing a random walk on {1,..., n}, slowed 
down by a factor of n, with gluing at 0. Therefore it glues with positive probability by time 
n 3 and with probability > 1 — l/2n by time Fn 3 logn. Thus by this time, with probability 
> 1 we have 0(0 = 0(0 for all z, or in other words, the coupling succeeded. This shows that 
the mixing time is < Cn 3 \ogn and in turn gives a lower bound on the spectral gap. □ 

2.4 Cycles and infinite path 

We consider the cycle C n for n > 3. We extend the notation by letting denote the infinite 
path. 
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From previous results it is simple to observe that Ric(C 3 ) = |, as C 3 = K 3 , and that 
Ric(C 4 ) = 2, because C 4 = H 2 . 

Theorem 2.5. If n > 5, Ric(C n ) = 0. 

Proof. We note that the calculation of Ric(G) at x requires us to consider only the subgraph 
consisting of those vertices v with d(x,v) < 2, and those edges incident to at least one 
neighbor of x. 

If n > 5, this subgraph will always be a path of length 4 centered at x, so we only need 
calculate the curvature for this graph. C n is an abelian Cayley graph, thus Ric > 0. 

Ric = 0 is achieved by the function / that takes values —2, —1, 0,1, 2 in order along the 
path. □ 

Corollary 2.6. Let 7L d represent the infinite d-dimensional lattice. Ric (fL d ) = 0. 

We simply note that Tfi is the product of d copies of C^. 

2.5 Slices of the hypercube 
2.5.1 k- slice with transpositions 

For some fixed value k with 1 < k < n, let G = (V, E ) be the graph with V = {x G {0, l} n : 
JV Xi = k}, and x ~ y whenever |supp(x — y) \ = 2. 

Theorem 2.7. This graph has curvature Ric = 1 + ^. 

Proof. Let x G V. Define SijX to be the vertex obtained by exchanging coordinates i and j 
in x. A vertex u with d(x,u ) = 2 will be u = SijSi m x for some distinct coordinates i,j,l,m 
with Xi = xi = 1, Xj = x m = 0. Vertices v with x ~ v ~ u are s^x, Si m x , s^x, si m x. Observe 
that 

X] - 2 f( v )) 2 ^ 2 (f( s H x ) - f(simx)) 2 + 2 (f(s im x) - f(sijx)) 2 . 

v:x~v~u 

Summing over all vertices u with d(x, u) = 2 gives 

\ X (/(“) ~ 2 f( v )) 2 > X (/( u ) - /M) 2 > 

X~V~U V,W~X 

d(x,u )=2 tfL(x,v,w) 


as for each pair v, w ~ x with v w, there is exactly one u with v,w ~ u and d(x, u ) = 2. 
(Here we use the notation fi.(x,v,w) to denote the set of unordered pairs ( v,w ) of distinct 
neighbors of x for which v w.) 

Also notice that any v ~ x has f({ayy}) = n — 2: if v = s^x, the vertices that make a 
triangle with x and v are s^x when l i and Xi = ay, and s im x when and x m = Xj. 
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Now we may compute 

2r 2 (/)(x) 

> Y (/( U )-/H) 2 + + ( 2_d + -y-) J ^/( v ) 2 

v,w~x \v^x / ^ 

fi.(xVVj) 

+ 2 (/( V )-/H) 2 

A (vwx) 

> Y (f( v ) ~ /M) 2 + fe /(«)! + ( x - d +f) /w 2 

V,w~x \v~X / v^x 

= (d-l)Yf( v ) 2 ~ 2 Y f( v )f( w ) + 5Z/( W ) 2 + 2 ^ f(v)f(w) 

V~X V,w~x v~x v,w~x 

+ (i -rf+^E/w 2 

v~x 

-(i + f)E/w 2 - 

V^X 

So Ric(G) > 1 + |. Together with Theorem 1.2 we get that Ric = 1 + |. □ 

2.5.2 Middle slice with adjacent transpositions 

We now consider G with V = {x E {—1, l} 2n : = 0}, where x ~ y <£=>■ supp(x — y) 

consists of 2 consecutive elements. Alternately, V is the set of paths in Z 2 that move from 
(0,0) to (2n, 0) with steps of (+1,+1) and (+1,-1), and paths x and y are neighbors if y 
can be achieved by transposing an adjacent (+1, +1) and (+1,-1) in x. 

Theorem 2.8. Ric(G) > —1. Further, lim Ric(G) = —1. 

n—>-oo 

Proof. Let x E V. Let I(x) = {i E {1,..., 2n — 1} : Xi ^ £;+i}, so i E I if and only if we 
are allowed to switch segments i and i + 1. If i E I(x), denote by dtx the vertex obtained 
by making this switch. Observe |/(x)| = deg(x). 

The neighbors of ciiX are: di(dix) = x, dj(dix) for any j E I(x) with |i — j\ > 1, and 
dj(aix) for any j ^ I{x) with \i — j\ = 1 and j ^ 0,2 n. We calculate that deg(ajx) = 
deg(x) + 2 - 2 #{j e I(x) : \i - j\ = 1 } - l<=i - h= 2 n-i- 

We observe that a neighbor of the form dj(dtx) if j E I(x) and \i—j\ > 1 will be identical 
to di(djx), and have d(x,djdix) = 2. 
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Now, for any function /, 


\ E (/(«)- 2 /(«)) 2 

U^Vr^X 

d(x,u)=2 

X ( f( a i a J x ) - 2 f(ciix)) 2 + (fiaittjx) - 2/ (cijx)) 2 

> X _ f( a j x )) 2 

i,j£l 

= X e /(x) : |j - i\ > 1 }/ 2 (a;x) - 2 X f i a i x ) f i a jx) . 

i£l(x) 

Observe that G is triangle-free. We have that 

2 r 2 (/)W 

> E S /(x) : |.y - i| > l}/ 2 (tiii) - 2 E f( a i x )f( a i x ) 

i£l(x) 

+ X] ^ 2 ( aiX ) + 2 X f( a i x )f( a i x ) 

iei(x ) ij'e/ 

+ E 

iei(x) 

> X (#{j e /(x) : i 7 ^ j} + 2 - deg(x)) f^x) + 2 X f( a i x )f( a j x ) 

i£l(x) ijj’G/ 

= X] / 2 ( a * a; ) + 2 X f( a i x )f( a i x ) 

i£l(x) zjG/ 

> - X •Z’ 2 ^) + X (/( a * x ) + f( a i x )f ^ - 2r (/)( a; ) • 

zG/(x) zj'E/ 

l*-j| =i 

So Ric(G) > — 1, where we ignore a slight dependence on n in the lower order term. 
Define a function with /(+1, —1, +1, —1,...) = 0 and /(a,x) = /(x) — x*, that is, if the 
switch lowers the path, / decreases by 1; a switch that raises the path will increase / by 1. 
Using this / and x = (+1, —1, +1, —1,...), we find that Ric —> — 1 as n —> oo. □ 

We now calculate the curvature for the subgraph G + that is induced on the Dyck paths, 
i.e., those paths that are always on or above the x-axis. Alternately, sequences in {±l} 2n 
with Yf=i x i — 0 and Y2i=i x i — 0 f° r a ll 3 = 0,..., 2 n. It is well-known that the number of 
Dyck paths is the Catalan number C n . 


2 • deg(x) + 2 - 2#{j G /(x) : \i - j\ = 1} - l;=i - l i= 


i=2n—l 


f 2 {aiX ) 
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Corollary 2.9. For this subgraph G + , Ric(G + ) > —1. Further, lim Ric(G + ) = —1. 

n—>-oo 

Proof sketch. Let x G V, and let 

I(x) = {i G [2 n — 1 ] : a possible move is to transpose Xi,Xi + 1 }. 

If i G /, let ajX be the sequence obtained by transposing Xi,x i+ i. 

Observe that degree) < deg(x) + 2 — 2 ff{j E I(x) \ \i — j\ = 1} — t i= i — lj =2n -i- Using 
the same analysis as in the unrestricted problem, we may conclude that 

2r 2 (f)(x) > -2T(f)(x). 

A similar test-function as above will prove that Ric < —1 + o(l). We may use the same 
function /, and take x identical to the above example but with the first —1 and last +1 
transposed. This will give a similar upper bound on Ric. (Observe that the neighbors and 
second-neighbors of x in the unrestricted graph are all Dyck paths, so the curvature at x 
will be unchanged from the original.) □ 


2.6 The symmetric group S n with all transpositions 

Theorem 2.10. Let G be the Cayley graph on the symmetric group S n with all transpositions 
as generators. Then Ric(G) = 2. 

Let us remark that in recent work [17] the authors also provided a lower bound for the 
Ricci curvature of the (Cayley) graph on the symmetric group with the edge set given by 
transpositions, but with a different notion of Ricci curvature, one developed by Erbar and 
Maas [16]. It is easy to see that the Ricci curvature developed by Ollivier [30] gives a value 
of k = 2/(”) for this problem in the setting of a Markov chain. A simple coupling argument 
shows that this agrees with our result, modulo the normalizing factor between the graph 
setting and the Markov chain setting. 

Proof. Let x G S n . A vertex u with d(u,x ) = 2 will either be ( ijk)x for some distinct 
i,j,k G [n] or ( ij){kl)x for distinct i,j,k,l G [n]. 

In the first case, the vertices v s.t. ( ijk)x ~ v ~ x are v = ( ij)x , ( ik)x , ( jk)x. For 
u = (■ ijk)(x ), 


(/(“) - 2/( u )) 2 


= (/(«) - 2 f((ij)x)) 2 + ( f(u ) - 2 f((ik)x)) 2 + ( f(u ) - 2 f{(jk)x)) 2 

4 r (/((u)^) - f({ik)x)) 2 + (f({ij)x) - f({jk)x)) 2 + (f{(ik)x) - f{{jk)x)Y 


> - 

- 3 


In the second case, a v such that ( ij)[kl)x ~ v ~ x is either v = ( ij)x or v = ( kl)x . If 
u = ( ij)(kl)(x ), 

(/(“) - 2 f( v )Y ^ 2 (/((u» - f{(kl)x)) 2 . 
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Taking a sum over all values of u gives 

\ E (/(«)- 2/w) 2 > E u(v)~i{w)f. 

u~v~x v,w~x 

d(u,x )=2 

Indeed, if v, w are u = ( ij)x and w = (■ ik)x for some i,j, k, the term (f(v) — f(w)) is counted 
twice in the sum: for u = ( ijk)x and u = ( ikj)x. If v,w are v = ( ij)x and w = ( kl)x for 
some i,j, k, l, the term 2 (f(v) — f(w)) is counted once: for u = ( ij)(kl)x. 

Observe that G is triangle-free and regular with degree d = ("). Using this bound, we 
see that 

2r 2 (/)(x) > E (/(») - /H) 2 + (E/(”)) 2 + (2 - d)E f'M 

V,W~X V~X V^X 

= 2 E/ 2 M = 4r (/)(*)■ 

V~x 

Therefore Ric > 2, as G is triangle-free, Ric = 2 by Theorem 1.2. □ 


3 Spectral gap and curvature 


Let A(G) denote the spectral gap of G; i.e., the least nonzero eigenvalue of —A. 

Theorem 3.1. Let G be a graph with curvature Ric > K > 0. Then A > K. 

A different proof of this result was given in [11], 

Proof. We may use the 2nd derivative versus the first derivative (of variance of the heat 
kernel) characterization of the spectral gap (see e.g. [28]). 

> . £(-A/,/) 

= T ~£UJT’ 

so that a < A if and only if, for any function /, we have a ■ S(f, f) < £(—A/, /). 

By assumption, G satisfies (3) with parameter K, i.e., that 

A T(/)(x) - 2 T(/, Af)(x) - 2 KT(f)(x) > 0 , 


for all functions / : V —> R and all x E V. Summing the above inequality over all vertices 
gives 

E Ar(/)(*) - 2E r(A/, f)(x) - 2 k Er(/)(x) 

XX X 

= 2 E(A f(x))- - K E E (/(») - /t 1 )) 2 

x x y~x 

= 2 E(A/(x)) 2 - 2A' E (/(</) - /(x)) 2 > o 

x x~y 
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where in the first equality, we used the identity (1) and the fact that for any g, Y Ag = 0. 
Now let \V\ = n, and recall the Dirichlet form (with respect to the measure n = 1), 


X~y 

and that 

£(- a /, f) = Yl ~ A f( x ) (- f(y)i) = J2( A f( x ^ 2 • 

x y~x x 

Plugging into the above inequality gives 

2£(—A/, /) — 2KS(f, f) > 0, 

and so 

KS(fJ)<E(-AfJ), 


resulting in A > K . □ 

4 Buser-type Inequalities 

The proofs in this section are a straightforward discrete version of § 5 of Ledoux’s paper [23]. 
First we derive a key gradient estimate on the heat kernel associated with a graph, which 
will then be used in deriving a Buser inequality for graphs, as mentioned in the introduction. 

4.1 Gradient estimates 

For t > 0, we write Pt = exp(iA) for the heat kernel associated with the graph G. Then P t is 
a positive definite matrix on R K , with Pq being the identity matrix. Note that Pt commutes 
with A and with P s , and that dP t /dt = P t A = A P t . Finally, the matrix P t has non-negative 
entries. So if / has non-negative entries, then also (/) has non-negative entries. For a 
vector / : V -+ R we write ||/|| p = (£„ 1/(^1^. 

Lemma 4.1. Suppose G has Ric(G) > K for some K e R. Then, for any f : V —> R and 
any 0 < t < 1/\2K\, 

ll/-a/l| 1 <2\^nvF7)lli. 


Note that the restriction on t applies only when K is negative: if K > 0 then Ric > K 
implies Ric > 0 and the lemma holds with no restriction on t. 
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Proof. The proof is in three steps. 

Step 1 . We first prove that 

T(P t f)<e- 2Kt P t (T(f)), 

where the inequality holds pointwise on V (recalling that these are real-valued functions on 
V). To that end, define the auxiliary function g s = e~ 2Ks P s (T(Pt- s f)), a function on V. It 
is enough to show that dg s /ds is pointwise non-negative on (0,t). We compute 

^ = e~ 2Ks P s [2T 2 {P t _ sf ) - 2KY{P t _ s f)] . 

Since P s preserves non-negativity, it is enough to prove that 

r 2 (P t _ sf ) - KT(P t _J) >0, 


which is true by our assumption, that Ric(G) > K. 

Step 2. Next we prove that 

W 2 ) - ( Ptf ) 2 > 2e 2Ks ds^j F(P t f). (8) 

To that end, dehne the auxiliary function g s = P s [(P t - s f) 2 ]. It is enough to show that 
dg s /ds > 2e 2Ks T(P t f), for any 0 < s < t. We compute, using the local identity (2) 
mentioned earlier, 

do 

= Ps [2 Pt-sf • A Pt-sf + 2 T(P t _ s f)] + P s [2 P t _ s f ■ (-A Ptsf)}. 

Hence, by Step 1, for any 0 < s < t, 

^ = 2 P s (T(P t _ s f)) > 2e 2Ks T(PJ), 


which gives (8). 

Denote cx(t) = 2e 2Ks ds. Then cx(t) = ( e 2Kt — 1 )/K, for non-zero K, and exit) = 2t 
for K = 0. In both cases, exit) ~ 2 1 for small t > 0. For instance, exit) > t for 0 < t < 
l/(2|Ji |). Hence (8) gives, for 0 < t < 1/(2|A'|), 

max y/T(P t f) < -j= max v 7 Pt{f 2 ) < max |/|. (9) 

Step 3. As can be guessed by now, we begin by writing 

Ptf~f= f -ff^-ds — fPsAfds. 

Jo ds J o 
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To prove the lemma, it suffices to show that ||P s (A/)||i < s 1 P\\i (since we have 
f ( ' } s~P 2 ds = 2y/t). Let ip — sgn(P s (A/)). Then, 

im(A/)ii 1 = y p,(Af)( X ) ■ </> = y A /w ■ = E - r </, w)) w 

xGV xGV xEV 

< VWW) - r ( p sW){x) < || V / r(/)||i-max ^(P,^))^), 

z —* rrGv/ 

z<EV 

and the desired inequality follows from (9), as max \ip\ = 1. □ 


4.2 Spectral gap and isoperimetry 

Theorem 4.2. Suppose G has Ric(G) > K, for some K e M. Denote by A > 0, the minimal 
non-zero eigenvalue of —A. Then, for any subset A C V, 

Here, by dA, we mean the collection of all edges connecting A to its complement. 

As noted in the previous lemma, the term A/ y/2\K\ is relevant only in the case K < 0. 

Proof. Apply the previous lemma to / = 1a- Then T(l^) is the function which associates 
with each v G V, the number of edges in dA that are incident with v. Consequently, for any 
0 < f < 1/(2|A"|), 

\\1 A -P t (l A )\\i < 2y/t-\dA\. 

Note that 0 < Pi(l^) < 1, hence the left-hand side may be written as follows: 


1a - A(1a)||i = \a\-J 2 p t(W + p 

A A c 


2 


\A\-^U-Pt(lA) 

V 


Since P t is self-adjoint and Pt/ 2 Pt /2 = Pt, then, 


(1/2) II 1^4 - P*(1a)||i = \A\ - \\P t/ 2(lA)\\l = \\U\l - 11-ft/2 (1a) 112 ■ 


Let <pi : 1 < i < n be the orthonormal eigenvectors of A, and let \ be the corresponding 
eigenvalues. Let 1a = fD a %Ti be the spectral decomposition of A, with cpo = l/VT^T an d 
do — \ A\jo/\V\. Then PtM^-A) = a id~ X,t/2 <Pi, and hence 

(1/2)HU - P.(l4)||i = E(1 - <T A '>? > (1 - e ~ xt )= (1 - e~ xt ) 

i i> 1 



To summarize, for any 0 < t < l/(2|/i|), 


\dA\ > 


1 — e xt 
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If A > 2\K\, we select t = 1/A < 1/2\K\, and deduce the theorem (use (1 — 1/e) > 1/2). If 
A < 2\K\, we take the maximal possible value, t = l/(2|Ji |). Then 1 — e -A / 2 l A l > A/(4|A'|), 
and the theorem follows. □ 


Corollary 4.3. Suppose a graph G has Ric(G) > K, for some K > 0. Then 

h > -VX. 

~ 4 

Proof. As already explained, when K > 0 we may ignore the term A/ y/2\K\ in the minimum 
in Theorem 3.1 and then the theorem gives 


m-\v\ 

\A\-\A\ 




and so we have 

h>-V A. □ 

_ 4 

4.3 Logarithmic Sobolev constant and isoperimetry 

We now prove an analogue of Theorem 5.3 from [23], relating the log-Sobolev constant p 
to an isoperimetric quantity. Consider the hypercontractive formulation of the log-Sobolev 
constant (see e.g., [20],[15]): namely, define p to be the greatest value so that whenever 

I q — 

1 < r < q < oo and \ -< e pt , then 

V r — 1 

n- 1 '" ||P,/||, < n-^ ||/|| r . 

Theorem 4.4. Suppose G has Ric(G) > K for some value K e M. Then for any subset 
A cV with \A\ < \V\/2 = n/2, 

|a^|>l m in(^,-^=) |A| log 

Proof. As in the proof of the above Theorem 4.2, we can observe that 

> IT _ IkkMll, 

n ~ n n 

if 0 < t < 1/(2 \K\). Using the hypercontractivity property with q = 2 and r = 1 + e~ 2pt 
gives that 

JUMldlfi < JM£ = ( MV /r 

n ~ n 2 / r \ n ) 
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Hence, 


V(M > Jdl _ H/vMk > Ml _ f My /r 

n ~ n n ~ n \ n / 


As 2/r > 1 + pt/ 4, whenever 0 < pt < 1, and |A|/n < 1, 

l+pi/4 


n / 


n / 


n 


pt/ 4 N 


Let t 0 = mill (1/2 |iL| , 1/p). If \A\/n < e , set t — 
Using this value of t in (10), we find 


4tn 


log(n/|A|)' 


|M| ^ 1 \A\ 


n 


> 


> 


(1 


_ p-pto 


y/t n 
1 \A\ 


(1 




n \!/2 


1^1 


log nr >7P^ m logrrr 


\A\J 


n 


2-v/to n 

On the other hand, if e 4 < | v4| /rz, < 1, use t = t 0 in (10) to find: 

m 


n \ 

R ) 


1/2 


n y/h 




\ 1/2 

1oe r) 


IMI 1 

- > — p\ I nun 

n ~ 16^ 


1 1\ |4| 


2\K\ pj n 


n \ 


1/2 


>7 — lo §UU >— min 


P 


\A\) 


16 


VW\ 


n 


( 10 ) 


where, for the second inequality, we use 1 — 2 x > x/2, if 0 < x < 1. Hence, 

AhL f log— x 1/2 


| 4 | 


proving the theorem. □ 

The optimality of the above theorem (in terms of the dependence on the parameters 
involved) remains open at this time; in particular, we do not have tight examples. It is also 
natural to ask if the bound p > K holds when Ric > I\ > 0, similar to the bound on A 
in Theorem 3.1. In general this is not true, consider the complete graph on n vertices. We 
have seen that Ric = 1 + |, and it is easy to see (by considering the characteristic function 
of a set as a test function) and is also well-known that p = O(p^) (see e.g., [28]). 

It is however true that under a different notion of discrete curvature for reversible Markov 
chains, one developed by Erbar and Maas, the so-called modified logarithmic Sobolev con¬ 
stant, p 0 , can be lower bounded by the curvature, see [16]. Thus it is certainly interesting to 
explore whether an analog of Theorem 3.1 is true with p 0 in place of A; recall here that po 
captures the rate of decay of relative entropy of the Markov chain, relative to the equilibrium 
distribution, while p captures the hypercontractivity property of the Markov kernel (see [28] 
for additional information). 
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